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Reconstructing the free energy landscape of a polyprotein by 
single-molecule experiments 
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PACS 82 . 37 . Rs - Single molecule manipulation of proteins and other biological molecules 

pacs 87. 15. La - Mechanical properties 

PACS 85.78.Ln - Nonequilibrium and irreversible thermodynamics 

Abstract. - The mechanical unfolding of an engineered protein composed of eight domains of Ig27 is in- 
vestigated by using atomic force microscopy. Exploiting a fluctuation relation, the equilibrium free energy as 
a function of the molecule elongation is estimated from pulling experiments. Such a free energy exhibits a 
regular shape that sets a typical unfolding length at zero force of the order of 20 nm. This length scale turns 
out to be much larger than the kinetic unfolding length that is also estimated by analyzing the typical rupture 
force of the molecule under dynamic loading. 
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Force spectroscopy techniques have enormously increased 
our knowledge on the structure of biopolymers such as pro- 
^ teins and nucleic acids. The possibility of controlling very pre- 
■ cisely the force applied to the free ends of such molecules al- 
' lowed the experimental evaluation of the typical lengths and 
energies of the bonds stabilizing the molecular structures [1-9]. 
' The mechanical unfolding of a protein is typically an out- 
of-equilibrium process, where the unfolding occurs in time 
scales much shorter than the typical molecule relaxation time: 
i this prevents the possibility of performing the experiments in 
quasiequilibrium conditions and thus of obtaining direct mea- 
surements of thermodynamic variables. However, this prob- 
lem can be overcome by using the remarkable equality de- 
rived by Jarzynski [10], which allows one to measure the free 
energy difference between the folded and the unfolded state 
of a biomolecule [11]. By exploiting an extended version of 
the Jarzynski equality (JE) the free energy landscape of model 
biopolymers as a function of the molecular elongation has been 
evaluated [12-18]. Furthermore in a recent experimental work 
[19], the free energy landscape of a molecule similar to the one 
we use here has been reconstructed, in a range of the molecular 
elongation corresponding the unfolding of a single domain (see 
below for a discussion of the protein structure). Usually, the 
information concerning the landscape of a protein, obtained by 
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single-molecule experiments, has been limited to the position 
and to the height of the energy baiTier along the reaction co- 
ordinate, which is usually the molecule elongation. The actual 
shape of the landscape can be only guessed by adapting it to the 
information about the kinetics gathered during the unzipping 
experiments, see e.g., [20]. However, in a recent paper [16] 
it has been argued that the unfolding length and the unfolding 
rate that are obtained by investigating the kinetics of protein un- 
folding, are not simply related to the position and height of free 
energy baiTiers as given by the extended JE. In other words, 
the kinetic parameters governing the protein unfolding may not 
coiTespond to the equilibrium thermodynamic al properties of 
the molecules. 

The aim of this paper is thus to show that it is possible to 
reconstruct the free energy landscape (EEL) of a protein for any 
value of its elongation. Furthermore, we aim to compare the 
typical length scale of this landscape with the unfolding length 
measured in kinetic experiments, and to discuss the meaning of 
this parameter in the context of the equilibrium properties of 
proteins. 

Unfolding of large proteins is a very rare and slow event, 
but by using single-molecule techniques it is possible to drive 
mechanical unfolding and make it feasible to study the unfold- 
ing kinetics of these molecules. The molecule investigated in 
this work is a recombinant polyprotein composed of eight re- 
peats of the Ig27 domain of human titin in PBS buffer (I270TM, 
Athena Enzyme Systems (TM) Baltimore, MD, Cat. No.0304). 
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The proteins were allowed to absorb onto a gold substrate, 
and any free protein was removed by washing the sample off. 
In single-molecule force spectroscopy experiments exploiting 
atomic force microscope (AFM), the molecule deposited onto 
substrate is grabbed by the AFM probe. Mechanical unfolding 
of the protein is thus induced by moving the probe away from 
the substrate with a constant velocity v (linear protocol). As the 
probe is retracted, the force exerted on the molecule increases 
until the molecule suddenly unfolds. If the molecule is com- 
posed of multiple domains, as in the present paper, further re- 
traction causes the extension of successive folded domains, un- 
til all the domains are unfolded and the protein-tip interaction 
is broken. The corresponding force-distance curve has a typical 
sawtooth structure, as shown in fig. 1. The probe of an AFM 
behaves as an harmonic spring within a wide range of values of 
the deformation with respect to the equilibrium position, and 
thus the force exerted on the molecule is / = k{z{t) - €) where 
k is the spring constant, { is the elongation of the molecule and 
z{f) = vt is the actual position of the probe with respect to the 
substrate (see fig. 1). In the present work, mechanical unfold- 
ing experiments are performed with velocities v ranging from 
200 to 2000 nm /s at room temperature. 
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Fig. 1: Top panel: Cartoon of the protein attached to the tip on an 
AFM. Bottom panel: Force extension curve of the polyprotein com- 
posed of eight repeats of the Ig27 domain. The typical distance be- 
tween two consecutive peaks ranges between 22 and 26 nm. 

Let us now consider the case where a molecular bond, or 
a group of molecular bonds are subject to a force increasing 
linearly with the time / = rt, where r is the force increase rate. 



It can be shown that the rupture force is distributed according 
to the probability distribution function [21] 



P{f) = — e^^^" exp 
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and the typical rupture force, defined as the maximum of P{f), 
is given by 

/* = — ln|;6rx„To], (2) 

where x„ is the unfolding length, usually interpreted as the de- 
formation after which the molecule unfolds, tq is the typical 
unfolding time at zero force, andyS - l/iksT). Since we use a 
rather soft spring (k - 0.04 N/m), we take the rate of increase of 
the force in eqs. (l)-(2) to be r = kv. By plotting the force as a 
function of the time, we verified that this is a good approxima- 
tion for the force rate (data not shown). Because the AFM tip 
can bind any module of the protein at random, one is not able 
to control the site from which the protein is picked up [6, 7]. 
Therefore, we obtained a random sample of single molecule 
unfolding trajectories containing from one to eight unfolding 
events. In figure 2 we plot the typical rupture force /* of a 
single Ig27 module as a function of the AFM probe velocity 
V, where trajectories with at least three repeats (i.e. rupture 
events) are considered. By fitting the data to eq. (2) we obtain 
an estimate of the unfolding length x„ = 0.30 + 0.07 nm, which 
is in good agreement with the value x„ = 0.25 nm found in [2]. 
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Fig. 2: Rupture force /* of a single Ig27 domain as a function of the 
AFM probe velocity v, the line is a fit to eq. (2), the number of single 
domain rupture ranges between 104 and 751 (v = 200nm/s — > 751 
single rupture events, v = 400nm/s -^ 367, v = 600nm/s -^ 209, 
V = 800nm/s -^ 286, v = lOOOnm/s -^ 104, v = 2000nm/s -^ 351). 
Inset: Histogram of the rupture force for v = 200 nm/s, the line is a fit 
toeq. (1). 

We now focus on the evaluation of the free energy landscape 
F()({), which is a function of the molecular elongation {. As 
discussed in ref. [12], such an energy landscape can be obtained 
by exploiting an extend version of the Jarzynski equality 



Q^-m^^ ^W,m) ^ exp [-liF,{{)] IZ,, (3) 
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Fig. 3: Free energy landscape Fo as a function of the molecular 
elongation t, as given by eq. (3) for different values probe velocity, 
V = 200, 400, 2000 nm/s: thenumberof unfolding trajectories consid- 
ered are 66, 35, 29, respectively. The error bars are obtained by using 
the jackknife approach [22] for data resampling: for each value of v 
subsamples of the total number of trajectories are considered, and for 
each subsample the free energy landscape is evaluated, as discussed 
in the text. One obtains thus a set of curves Fq({), and the error bars 
are obtained, for each value of {, as the standard deviation of the mean 
with respect to this set. The bars are mutually shifted for clarity's sake. 
Inset: Plot of F in the small £ range. 



where W is the work done on the molecule during the unfolding 
process, Zq is the partition function of the unperturbed system 
(folded molecule with / = 0), and U{(,z{t)) = /t/2(z(f) - (f 
is the external potential associated to the tip of the AFM. Note 
that, when one takes the logarithm of the rhs of eq. (3), the 
partition function appears in the additive constant - log Zq, and 
thus it plays no role in the setting of Fo({). In the following, 
we do not consider all the unfolding trajectories that we used 
to obtain the data in fig. (2), but we take only those trajecto- 
ries with at least six rupture events. For each of these trajec- 
tories, we compute the work W done on the molecule by the 
external force f{t) - k{z{t) - iif)). Practical procedures for 
obtaining the optimal estimate of FqH) from eq. (3) are dis- 
cussed in refs. [12-14]. However, as the value of the work W 
exceeds few hundreds of ksT, eq. (3) gives unreliable results, 
since in that case the average is estimated by considering very 
small numbers (exp(-/3W) <s: 1). In order to avoid this nu- 
merical problem, we calculate here the average of the quantity 
exp(-j8W + A), where A is a fixed quantity, so as eq. (3) pro- 
vides an estimate of exp [-y6(F()(L) + A)]. By taking different 
values for A one selects ranges of values of the work W such 
that W -H A is not larger than a few hundreds of kBT, and thus 
one can reconstruct the free energy landscape Fq{() piecewise. 
In particular we take A = 0, 1000, 2000, 3000, 4000/^^7. 

In refs. [14, 15] it was argued that eq. (3) supplies a reli- 
able estimate of the molecule free energy, provided that, as the 
pulling rate is decreased, the estimated curves Fq{() collapse 
onto a single curve. In figure 3, the free energy landscape Fo{l) 
as reconstructed from eq. (3) is plotted for different values of 
the AFM tip velocity v. We actually observe a collapse of the 



curves for the two smallest velocities, while the agreement with 
the curve obtained with the largest value of v = 2000 nm/s 
worsens as the coordinate t increases. 
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Fig. 4: Free energy landscape F(i,f) = F(,(£) - ft for three values 
of the force, / = 0, 50, 100 pN. The function Fo(^) corresponds to the 
most reliable estimate of Fo{C) we obtained, i.e. the one with v = 200 
nm/s. 

The energy landscape Fq(1) exhibits a single minimum at 
t - 0, corresponding to the molecule in the folded native 
state. However the curve Fo(^) exhibits cusps which are eq- 
uispaced at a distance of M ^ 20 nm. As a constant force 
/ is applied, the free energy landscape is tilted according to 
FiiJ) = Fi){t) - f€. In figure 4 we plot the FEL, as obtained 
with the smallest velocity here considered, for different values 
of the force /. As the external force / is applied, the landscape 
F{(, f) becomes more intricate: the cusps become local max- 
ima, delimiting equispaced minima. The overcoming of each 
such cusp corresponds thus to the breaking of a single Ig27 
domain. For each value of the force, a well defined global min- 
imum appears in the function F(f, /), therefore F({, f) predicts 
how many domains out of the eight will be unfolded at equilib- 
rium for that value of the force. As an example, we find that the 
second minimum becomes deeper that the first one as / > 50 
pN, see inset of fig. 4. Thus, we expect that in a force clamp ex- 
periment, where a feedback system allows to apply a constant 
force to the molecule free ends [7,8], this would be the constant 
force required to unfold a single Ig27 domain. In a different ex- 
perimental set-up, where force-ramp pulling was used, the first 
Ig27 domain was found to unfold at a typical force f - 15 pN 
(see fig. 5b of ref. [5]): such a value is consistent with / ^ 50 
pN predicted by our results. 

We want to stress that at zero force the distance between the 
global minimum { - Q and the first cusp is Mif = 0) - 20 
nm, while at / = 50 pN the distance between the first min- 
imum and the following maximum is M(f - 50) ^ 6 nm. 
These lengths are one order of magnitude larger than the ki- 
netic unfolding length x„ = 0.25 nm found in [2] and in the 
present work. Thus as discussed in ref [16], the quantity Xi, ap- 
pearing in eq. (2) represents a kinetic parameter, and does not 
correspond to the typical length of the free energy landscape: 
at zero force the thermal fluctuations have to induce a deforma- 
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tion M, - 20nm » x„ before the molecule unfolds, while at 
constant force / = 50 pN this deformation has to be A€ = 6 
nm. 

Let us now discuss the reliability of the reconstruction of 
Fo(^) over the whole range of values of I. As the AFM probe 
is retracted and the molecule progressively unfolds, the sys- 
tem finds itself farther and farther from equilibrium. Since the 
Jarzynski equality requires that all the trajectories start from the 
equilibrium condition [10], we are more confident on the relia- 
bility of the reconstruction of ¥{){€) in the small-to-moderate I 
range, than in the large I range. However, inspection of fig. 4 
indicates that each of the eight energy wells exhibits an almost 
identical shape. In fact the distance between two successive 
cusps is constant, and at / > the height of each relative max- 
imum with respect to the respective energy minimum is also 
constant, within a few ksT-s. 

We now compare our results with those of ref. [19], where 
the PEL of a similar molecule was investigated. In that paper 
the reconstruction of the PEL was limited to /' < 30 nm, i.e. to 
the rupture of a single Ig27 domain, while here we manage to 
reconstruct the PEL for the whole range of the molecular ex- 
tension, as discussed above. In ref. [19] the minimal velocity 
used is V = 50 nm/s, which is one fourth of the smallest ve- 
locity used here v = 200 nm/s. However for this velocity, we 
find that at the first cusp (i* ^ 20 nm) the free energy takes the 
value Fo(£*) ^ 200 kgT (see inset of fig. 3) which is slightly 
larger than the value found in that paper Fq(£ ^ 20) ^167 ksT. 
This result indicates that although we use a larger velocity com- 
pared to ref. [19], we obtain a reliable estimate of the PEL for 
the polyprotein considered, and using a smaller velocity would 
only give negligible corrections to our estimate. 

We find that, at v = 200 nm/s, the average value of the 
work performed during the unfolding of a single Ig27 domain 
is {W*} ^ 405 hgT, where W* corresponds to the value of the 
work, when the position of the probe is z* = {* - 20 nm. 
The average work to unfold the successive Ig27 domains takes 
the same value within statistical errors. This value has to be 
compared with the value of the energy landscape, Fq{{*) ^ 
200 ksT, i.e. for the value of the coordinate £ where a single 
domain is unfolded, see figure 3. Thus, we have {W*} > Fq{£*), 
and can conclude that the Jarzynski equality gives a better esti- 
mate of F(){£) than the classical measurement of {W}. 

In conclusion we have evaluated the free energy landscape of 
a polyprotein by combining atomic force microscopy and the 
JE. The landscape exhibits equispaced wells, each correspond- 
ing to a single domain of the protein. The typical length scale of 
this landscape is 20 nm, which is larger than the typical unfold- 
ing length X[, as obtained by analyzing the typical rupture force. 
We want to remark that in ref. [2] the length x„ - 0.25 nm was 
identified as the distance of the free energy barrier from the 
folded state. Our results suggest, on the contrary, that, at / = 0, 
the distance between the folded state and the first energy barrier 
is of the order of 20 nm. We attribute this difiference to the fact 
that the quantity x„ as obtained by eq. (2) is a kinetic parameter 
describing the unfolding kinetics of the Ig27 domain, while the 
extended JE (3) provides an equilibrium quantity, namely the 
equilibrium PEL. 



It is worth to note that as our approach is successful to re- 
construct the PEL of a protein made up of identical domains in 
the whole range of the molecule extension, we believe that the 
same approach can be successfully used to probe the PEL of 
heterogeneous proteins made up of different domains. 
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